Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 5020 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 784.4 KiB |
| Average record size in memory | 160.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 10 |
| DateTime | 1 |
| Categorical | 7 |
Price_x is highly overall correlated with Price_y and 2 other fields | High correlation |
Qty is highly overall correlated with TotalAmount | High correlation |
TotalAmount is highly overall correlated with Qty | High correlation |
StoreID is highly overall correlated with Latitude and 3 other fields | High correlation |
Latitude is highly overall correlated with StoreID and 3 other fields | High correlation |
Longitude is highly overall correlated with StoreName and 2 other fields | High correlation |
Price_y is highly overall correlated with Price_x and 2 other fields | High correlation |
Age is highly overall correlated with Income and 1 other fields | High correlation |
Income is highly overall correlated with Age | High correlation |
ProductID is highly overall correlated with Price_x and 2 other fields | High correlation |
StoreName is highly overall correlated with StoreID and 4 other fields | High correlation |
GroupStore is highly overall correlated with StoreID and 4 other fields | High correlation |
Type is highly overall correlated with StoreID and 4 other fields | High correlation |
Product Name is highly overall correlated with Price_x and 2 other fields | High correlation |
Marital Status is highly overall correlated with Age | High correlation |
Income has 185 (3.7%) zeros | Zeros |
Reproduction
| Analysis started | 2023-09-15 09:47:51.509929 |
|---|---|
| Analysis finished | 2023-09-15 09:48:33.003276 |
| Duration | 41.49 seconds |
| Software version | ydata-profiling vv4.3.1 |
| Download configuration | config.json |
TransactionID
Text
| Distinct | 4908 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.8822709 |
| Min length | 4 |
Characters and Unicode
| Total characters | 34549 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4797 ? |
|---|---|
| Unique (%) | 95.6% |
Sample
| 1st row | TR11369 |
|---|---|
| 2nd row | TR16356 |
| 3rd row | TR1984 |
| 4th row | TR35256 |
| 5th row | TR41231 |
| Value | Count | Frequency (%) |
| tr71313 | 3 | 0.1% |
| tr57126 | 2 | < 0.1% |
| tr88968 | 2 | < 0.1% |
| tr72611 | 2 | < 0.1% |
| tr6940 | 2 | < 0.1% |
| tr78366 | 2 | < 0.1% |
| tr51183 | 2 | < 0.1% |
| tr61742 | 2 | < 0.1% |
| tr33585 | 2 | < 0.1% |
| tr13665 | 2 | < 0.1% |
| Other values (4898) | 4999 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 5020 | |
| R | 5020 | |
| 2 | 2572 | |
| 4 | 2557 | |
| 9 | 2521 | |
| 7 | 2515 | |
| 8 | 2511 | |
| 3 | 2500 | |
| 1 | 2497 | |
| 6 | 2457 | |
| Other values (2) | 4379 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24509 | |
| Uppercase Letter | 10040 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2572 | |
| 4 | 2557 | |
| 9 | 2521 | |
| 7 | 2515 | |
| 8 | 2511 | |
| 3 | 2500 | |
| 1 | 2497 | |
| 6 | 2457 | |
| 5 | 2447 | |
| 0 | 1932 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 5020 | |
| R | 5020 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24509 | |
| Latin | 10040 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2572 | |
| 4 | 2557 | |
| 9 | 2521 | |
| 7 | 2515 | |
| 8 | 2511 | |
| 3 | 2500 | |
| 1 | 2497 | |
| 6 | 2457 | |
| 5 | 2447 | |
| 0 | 1932 |
Latin
| Value | Count | Frequency (%) |
| T | 5020 | |
| R | 5020 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34549 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 5020 | |
| R | 5020 | |
| 2 | 2572 | |
| 4 | 2557 | |
| 9 | 2521 | |
| 7 | 2515 | |
| 8 | 2511 | |
| 3 | 2500 | |
| 1 | 2497 | |
| 6 | 2457 | |
| Other values (2) | 4379 |
CustomerID
Real number (ℝ)
| Distinct | 447 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 221.26375 |
| Minimum | 1 |
|---|---|
| Maximum | 447 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 108 |
| median | 221 |
| Q3 | 332 |
| 95-th percentile | 426 |
| Maximum | 447 |
| Range | 446 |
| Interquartile range (IQR) | 224 |
Descriptive statistics
| Standard deviation | 129.67296 |
|---|---|
| Coefficient of variation (CV) | 0.58605604 |
| Kurtosis | -1.1934547 |
| Mean | 221.26375 |
| Median Absolute Deviation (MAD) | 112 |
| Skewness | 0.022381467 |
| Sum | 1110744 |
| Variance | 16815.075 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 156 | 21 | 0.4% |
| 365 | 20 | 0.4% |
| 392 | 20 | 0.4% |
| 89 | 19 | 0.4% |
| 44 | 19 | 0.4% |
| 245 | 19 | 0.4% |
| 13 | 19 | 0.4% |
| 445 | 18 | 0.4% |
| 444 | 18 | 0.4% |
| 189 | 18 | 0.4% |
| Other values (437) | 4829 |
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 2 | 13 | |
| 3 | 15 | |
| 4 | 10 | |
| 5 | 7 | |
| 6 | 10 | |
| 7 | 17 | |
| 8 | 14 | |
| 9 | 10 | |
| 10 | 14 |
| Value | Count | Frequency (%) |
| 447 | 13 | |
| 446 | 11 | |
| 445 | 18 | |
| 444 | 18 | |
| 443 | 16 | |
| 442 | 13 | |
| 441 | 5 | 0.1% |
| 440 | 12 | |
| 439 | 7 | 0.1% |
| 438 | 14 |
Date
Date
| Distinct | 365 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| Minimum | 2022-01-01 00:00:00 |
|---|---|
| Maximum | 2022-12-31 00:00:00 |
ProductID
Categorical
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| P5 | |
|---|---|
| P10 | |
| P2 | |
| P7 | |
| P3 | |
| Other values (5) |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.123506 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10660 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P3 |
|---|---|
| 2nd row | P9 |
| 3rd row | P1 |
| 4th row | P1 |
| 5th row | P9 |
Common Values
| Value | Count | Frequency (%) |
| P5 | 814 | |
| P10 | 620 | |
| P2 | 530 | |
| P7 | 522 | |
| P3 | 519 | |
| P9 | 488 | |
| P8 | 485 | |
| P1 | 397 | |
| P4 | 390 | |
| P6 | 255 | 5.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| p5 | 814 | |
| p10 | 620 | |
| p2 | 530 | |
| p7 | 522 | |
| p3 | 519 | |
| p9 | 488 | |
| p8 | 485 | |
| p1 | 397 | |
| p4 | 390 | |
| p6 | 255 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 5020 | |
| 1 | 1017 | 9.5% |
| 5 | 814 | 7.6% |
| 0 | 620 | 5.8% |
| 2 | 530 | 5.0% |
| 7 | 522 | 4.9% |
| 3 | 519 | 4.9% |
| 9 | 488 | 4.6% |
| 8 | 485 | 4.5% |
| 4 | 390 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5640 | |
| Uppercase Letter | 5020 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1017 | |
| 5 | 814 | |
| 0 | 620 | |
| 2 | 530 | |
| 7 | 522 | |
| 3 | 519 | |
| 9 | 488 | |
| 8 | 485 | |
| 4 | 390 | 6.9% |
| 6 | 255 | 4.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 5020 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5640 | |
| Latin | 5020 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1017 | |
| 5 | 814 | |
| 0 | 620 | |
| 2 | 530 | |
| 7 | 522 | |
| 3 | 519 | |
| 9 | 488 | |
| 8 | 485 | |
| 4 | 390 | 6.9% |
| 6 | 255 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| P | 5020 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 5020 | |
| 1 | 1017 | 9.5% |
| 5 | 814 | 7.6% |
| 0 | 620 | 5.8% |
| 2 | 530 | 5.0% |
| 7 | 522 | 4.9% |
| 3 | 519 | 4.9% |
| 9 | 488 | 4.6% |
| 8 | 485 | 4.5% |
| 4 | 390 | 3.7% |
Price_x
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9684.8008 |
| Minimum | 3200 |
|---|---|
| Maximum | 18000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 3200 |
|---|---|
| 5-th percentile | 3200 |
| Q1 | 4200 |
| median | 9400 |
| Q3 | 15000 |
| 95-th percentile | 18000 |
| Maximum | 18000 |
| Range | 14800 |
| Interquartile range (IQR) | 10800 |
Descriptive statistics
| Standard deviation | 4600.7088 |
|---|---|
| Coefficient of variation (CV) | 0.47504423 |
| Kurtosis | -1.1395182 |
| Mean | 9684.8008 |
| Median Absolute Deviation (MAD) | 5200 |
| Skewness | 0.16819672 |
| Sum | 48617700 |
| Variance | 21166521 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4200 | 814 | |
| 15000 | 620 | |
| 3200 | 530 | |
| 9400 | 522 | |
| 7500 | 519 | |
| 10000 | 488 | |
| 16000 | 485 | |
| 8800 | 397 | |
| 12000 | 390 | |
| 18000 | 255 | 5.1% |
| Value | Count | Frequency (%) |
| 3200 | 530 | |
| 4200 | 814 | |
| 7500 | 519 | |
| 8800 | 397 | |
| 9400 | 522 | |
| 10000 | 488 | |
| 12000 | 390 | |
| 15000 | 620 | |
| 16000 | 485 | |
| 18000 | 255 | 5.1% |
| Value | Count | Frequency (%) |
| 18000 | 255 | 5.1% |
| 16000 | 485 | |
| 15000 | 620 | |
| 12000 | 390 | |
| 10000 | 488 | |
| 9400 | 522 | |
| 8800 | 397 | |
| 7500 | 519 | |
| 4200 | 814 | |
| 3200 | 530 |
Qty
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6446215 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.8552954 |
|---|---|
| Coefficient of variation (CV) | 0.50905022 |
| Kurtosis | 0.39205731 |
| Mean | 3.6446215 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.67510385 |
| Sum | 18296 |
| Variance | 3.4421209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1078 | |
| 2 | 915 | |
| 4 | 869 | |
| 5 | 802 | |
| 1 | 601 | |
| 6 | 402 | 8.0% |
| 7 | 218 | 4.3% |
| 8 | 52 | 1.0% |
| 10 | 44 | 0.9% |
| 9 | 39 | 0.8% |
| Value | Count | Frequency (%) |
| 1 | 601 | |
| 2 | 915 | |
| 3 | 1078 | |
| 4 | 869 | |
| 5 | 802 | |
| 6 | 402 | 8.0% |
| 7 | 218 | 4.3% |
| 8 | 52 | 1.0% |
| 9 | 39 | 0.8% |
| 10 | 44 | 0.9% |
| Value | Count | Frequency (%) |
| 10 | 44 | 0.9% |
| 9 | 39 | 0.8% |
| 8 | 52 | 1.0% |
| 7 | 218 | 4.3% |
| 6 | 402 | 8.0% |
| 5 | 802 | |
| 4 | 869 | |
| 3 | 1078 | |
| 2 | 915 | |
| 1 | 601 |
TotalAmount
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32279.482 |
| Minimum | 7500 |
|---|---|
| Maximum | 88000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 7500 |
|---|---|
| 5-th percentile | 8400 |
| Q1 | 16000 |
| median | 28200 |
| Q3 | 47000 |
| 95-th percentile | 72000 |
| Maximum | 88000 |
| Range | 80500 |
| Interquartile range (IQR) | 31000 |
Descriptive statistics
| Standard deviation | 19675.462 |
|---|---|
| Coefficient of variation (CV) | 0.60953464 |
| Kurtosis | -0.32653276 |
| Mean | 32279.482 |
| Median Absolute Deviation (MAD) | 13200 |
| Skewness | 0.78934371 |
| Sum | 1.62043 × 108 |
| Variance | 3.8712382 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30000 | 313 | 6.2% |
| 16000 | 253 | 5.0% |
| 12600 | 238 | 4.7% |
| 60000 | 237 | 4.7% |
| 15000 | 224 | 4.5% |
| 48000 | 217 | 4.3% |
| 16800 | 198 | 3.9% |
| 21000 | 197 | 3.9% |
| 45000 | 185 | 3.7% |
| 8400 | 181 | 3.6% |
| Other values (34) | 2777 |
| Value | Count | Frequency (%) |
| 7500 | 80 | 1.6% |
| 8400 | 181 | |
| 9600 | 115 | |
| 10000 | 64 | 1.3% |
| 12000 | 89 | 1.8% |
| 12600 | 238 | |
| 12800 | 107 | |
| 15000 | 224 | |
| 16000 | 253 | |
| 16800 | 198 |
| Value | Count | Frequency (%) |
| 88000 | 44 | 0.9% |
| 79200 | 39 | 0.8% |
| 75000 | 139 | |
| 72000 | 58 | 1.2% |
| 70400 | 52 | 1.0% |
| 70000 | 74 | 1.5% |
| 61600 | 47 | 0.9% |
| 60000 | 237 | |
| 56400 | 96 | |
| 54000 | 67 | 1.3% |
StoreID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4898406 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 11 |
| 95-th percentile | 14 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.0285018 |
|---|---|
| Coefficient of variation (CV) | 0.53786215 |
| Kurtosis | -1.2171416 |
| Mean | 7.4898406 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.0021242975 |
| Sum | 37599 |
| Variance | 16.228827 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 370 | 7.4% |
| 13 | 368 | 7.3% |
| 6 | 368 | 7.3% |
| 3 | 367 | 7.3% |
| 2 | 364 | 7.3% |
| 12 | 363 | 7.2% |
| 5 | 362 | 7.2% |
| 10 | 355 | 7.1% |
| 7 | 355 | 7.1% |
| 11 | 355 | 7.1% |
| Other values (4) | 1393 |
| Value | Count | Frequency (%) |
| 1 | 354 | |
| 2 | 364 | |
| 3 | 367 | |
| 4 | 350 | |
| 5 | 362 | |
| 6 | 368 | |
| 7 | 355 | |
| 8 | 343 | |
| 9 | 370 | |
| 10 | 355 |
| Value | Count | Frequency (%) |
| 14 | 346 | |
| 13 | 368 | |
| 12 | 363 | |
| 11 | 355 | |
| 10 | 355 | |
| 9 | 370 | |
| 8 | 343 | |
| 7 | 355 | |
| 6 | 368 | |
| 5 | 362 |
StoreName
Categorical
HIGH CORRELATION 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| Lingga | |
|---|---|
| Sinar Harapan | |
| Buana | |
| Prima Kota | |
| Prima Kelapa Dua | |
| Other values (7) |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 10.326096 |
| Min length | 5 |
Characters and Unicode
| Total characters | 51837 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Prestasi Utama |
|---|---|
| 2nd row | Prima Tendean |
| 3rd row | Gita Ginara |
| 4th row | Gita Ginara |
| 5th row | Gita Ginara |
Common Values
| Value | Count | Frequency (%) |
| Lingga | 738 | |
| Sinar Harapan | 698 | |
| Buana | 368 | |
| Prima Kota | 367 | |
| Prima Kelapa Dua | 364 | |
| Prestasi Utama | 363 | |
| Bonafid | 362 | |
| Harapan Baru | 355 | |
| Buana Indah | 355 | |
| Prima Tendean | 354 | |
| Other values (2) | 696 |
Length
| Value | Count | Frequency (%) |
| prima | 1085 | |
| harapan | 1053 | |
| lingga | 738 | 8.6% |
| buana | 723 | 8.4% |
| sinar | 698 | 8.1% |
| kota | 367 | 4.3% |
| kelapa | 364 | 4.2% |
| dua | 364 | 4.2% |
| utama | 363 | 4.2% |
| prestasi | 363 | 4.2% |
| Other values (7) | 2472 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12842 | |
| n | 5679 | |
| i | 4292 | 8.3% |
| r | 4250 | 8.2% |
| 3570 | 6.9% | |
| g | 1822 | 3.5% |
| P | 1794 | 3.5% |
| m | 1448 | 2.8% |
| t | 1443 | 2.8% |
| u | 1442 | 2.8% |
| Other values (18) | 13255 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39677 | |
| Uppercase Letter | 8590 | 16.6% |
| Space Separator | 3570 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12842 | |
| n | 5679 | |
| i | 4292 | 10.8% |
| r | 4250 | 10.7% |
| g | 1822 | 4.6% |
| m | 1448 | 3.6% |
| t | 1443 | 3.6% |
| u | 1442 | 3.6% |
| e | 1435 | 3.6% |
| p | 1417 | 3.6% |
| Other values (6) | 3607 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1794 | |
| B | 1440 | |
| H | 1053 | |
| L | 738 | |
| K | 731 | |
| G | 700 | 8.1% |
| S | 698 | 8.1% |
| D | 364 | 4.2% |
| U | 363 | 4.2% |
| I | 355 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3570 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48267 | |
| Common | 3570 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12842 | |
| n | 5679 | |
| i | 4292 | 8.9% |
| r | 4250 | 8.8% |
| g | 1822 | 3.8% |
| P | 1794 | 3.7% |
| m | 1448 | 3.0% |
| t | 1443 | 3.0% |
| u | 1442 | 3.0% |
| B | 1440 | 3.0% |
| Other values (17) | 11815 |
Common
| Value | Count | Frequency (%) |
| 3570 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12842 | |
| n | 5679 | |
| i | 4292 | 8.3% |
| r | 4250 | 8.2% |
| 3570 | 6.9% | |
| g | 1822 | 3.5% |
| P | 1794 | 3.5% |
| m | 1448 | 2.8% |
| t | 1443 | 2.8% |
| u | 1442 | 2.8% |
| Other values (18) | 13255 |
GroupStore
Categorical
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| Prima | |
|---|---|
| Lingga | |
| Buana | |
| Prestasi | |
| Gita | |
| Other values (2) |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 6.6143426 |
| Min length | 4 |
Characters and Unicode
| Total characters | 33204 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Prestasi |
|---|---|
| 2nd row | Prima |
| 3rd row | Gita |
| 4th row | Gita |
| 5th row | Gita |
Common Values
| Value | Count | Frequency (%) |
| Prima | 1085 | |
| Lingga | 738 | |
| Buana | 723 | |
| Prestasi | 718 | |
| Gita | 712 | |
| Harapan Baru | 698 | |
| Priangan | 346 | 6.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| prima | 1085 | |
| lingga | 738 | |
| buana | 723 | |
| prestasi | 718 | |
| gita | 712 | |
| harapan | 698 | |
| baru | 698 | |
| priangan | 346 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8183 | |
| i | 3599 | |
| r | 3545 | |
| n | 2851 | 8.6% |
| P | 2149 | 6.5% |
| g | 1822 | 5.5% |
| s | 1436 | 4.3% |
| t | 1430 | 4.3% |
| u | 1421 | 4.3% |
| B | 1421 | 4.3% |
| Other values (7) | 5347 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26788 | |
| Uppercase Letter | 5718 | 17.2% |
| Space Separator | 698 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8183 | |
| i | 3599 | |
| r | 3545 | |
| n | 2851 | 10.6% |
| g | 1822 | 6.8% |
| s | 1436 | 5.4% |
| t | 1430 | 5.3% |
| u | 1421 | 5.3% |
| m | 1085 | 4.1% |
| e | 718 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2149 | |
| B | 1421 | |
| L | 738 | 12.9% |
| G | 712 | 12.5% |
| H | 698 | 12.2% |
Space Separator
| Value | Count | Frequency (%) |
| 698 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32506 | |
| Common | 698 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8183 | |
| i | 3599 | |
| r | 3545 | |
| n | 2851 | 8.8% |
| P | 2149 | 6.6% |
| g | 1822 | 5.6% |
| s | 1436 | 4.4% |
| t | 1430 | 4.4% |
| u | 1421 | 4.4% |
| B | 1421 | 4.4% |
| Other values (6) | 4649 |
Common
| Value | Count | Frequency (%) |
| 698 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33204 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8183 | |
| i | 3599 | |
| r | 3545 | |
| n | 2851 | 8.6% |
| P | 2149 | 6.5% |
| g | 1822 | 5.5% |
| s | 1436 | 4.3% |
| t | 1430 | 4.3% |
| u | 1421 | 4.3% |
| B | 1421 | 4.3% |
| Other values (7) | 5347 |
Type
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| General Trade | |
|---|---|
| Modern Trade |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.567928 |
| Min length | 12 |
Characters and Unicode
| Total characters | 63091 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | General Trade |
|---|---|
| 2nd row | Modern Trade |
| 3rd row | General Trade |
| 4th row | General Trade |
| 5th row | General Trade |
Common Values
| Value | Count | Frequency (%) |
| General Trade | 2851 | |
| Modern Trade | 2169 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trade | 5020 | |
| general | 2851 | |
| modern | 2169 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12891 | |
| r | 10040 | |
| a | 7871 | |
| d | 7189 | |
| n | 5020 | 8.0% |
| 5020 | 8.0% | |
| T | 5020 | 8.0% |
| G | 2851 | 4.5% |
| l | 2851 | 4.5% |
| M | 2169 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48031 | |
| Uppercase Letter | 10040 | 15.9% |
| Space Separator | 5020 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12891 | |
| r | 10040 | |
| a | 7871 | |
| d | 7189 | |
| n | 5020 | 10.5% |
| l | 2851 | 5.9% |
| o | 2169 | 4.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 5020 | |
| G | 2851 | |
| M | 2169 |
Space Separator
| Value | Count | Frequency (%) |
| 5020 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58071 | |
| Common | 5020 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 12891 | |
| r | 10040 | |
| a | 7871 | |
| d | 7189 | |
| n | 5020 | 8.6% |
| T | 5020 | 8.6% |
| G | 2851 | 4.9% |
| l | 2851 | 4.9% |
| M | 2169 | 3.7% |
| o | 2169 | 3.7% |
Common
| Value | Count | Frequency (%) |
| 5020 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63091 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 12891 | |
| r | 10040 | |
| a | 7871 | |
| d | 7189 | |
| n | 5020 | 8.0% |
| 5020 | 8.0% | |
| T | 5020 | 8.0% |
| G | 2851 | 4.5% |
| l | 2851 | 4.5% |
| M | 2169 | 3.4% |
Latitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -2.9422362 |
| Minimum | -7.797068 |
|---|---|
| Maximum | 5.54829 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3612 |
| Negative (%) | 72.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | -7.797068 |
|---|---|
| 5-th percentile | -7.797068 |
| Q1 | -6.914864 |
| median | -5.135399 |
| Q3 | 0.533505 |
| 95-th percentile | 5.54829 |
| Maximum | 5.54829 |
| Range | 13.345358 |
| Interquartile range (IQR) | 7.448369 |
Descriptive statistics
| Standard deviation | 4.323225 |
|---|---|
| Coefficient of variation (CV) | -1.4693671 |
| Kurtosis | -0.94353879 |
| Mean | -2.9422362 |
| Median Absolute Deviation (MAD) | 2.115046 |
| Skewness | 0.67736993 |
| Sum | -14770.026 |
| Variance | 18.690274 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -3.654703 | 370 | 7.4% |
| -1.26916 | 368 | 7.3% |
| -5.135399 | 368 | 7.3% |
| -7.797068 | 367 | 7.3% |
| -6.914864 | 364 | 7.3% |
| -2.990934 | 363 | 7.2% |
| -7.250445 | 362 | 7.2% |
| 3.597031 | 355 | 7.1% |
| 3.316694 | 355 | 7.1% |
| 0.533505 | 355 | 7.1% |
| Other values (4) | 1393 |
| Value | Count | Frequency (%) |
| -7.797068 | 367 | |
| -7.250445 | 362 | |
| -6.966667 | 350 | |
| -6.914864 | 364 | |
| -6.2 | 354 | |
| -5.45 | 346 | |
| -5.135399 | 368 | |
| -3.654703 | 370 | |
| -2.990934 | 363 | |
| -1.26916 | 368 |
| Value | Count | Frequency (%) |
| 5.54829 | 343 | |
| 3.597031 | 355 | |
| 3.316694 | 355 | |
| 0.533505 | 355 | |
| -1.26916 | 368 | |
| -2.990934 | 363 | |
| -3.654703 | 370 | |
| -5.135399 | 368 | |
| -5.45 | 346 | |
| -6.2 | 354 |
Longitude
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 109.60079 |
| Minimum | 95.323753 |
|---|---|
| Maximum | 128.19064 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 95.323753 |
|---|---|
| 5-th percentile | 95.323753 |
| Q1 | 104.75655 |
| median | 110.37053 |
| Q3 | 114.59011 |
| 95-th percentile | 128.19064 |
| Maximum | 128.19064 |
| Range | 32.86689 |
| Interquartile range (IQR) | 9.833557 |
Descriptive statistics
| Standard deviation | 8.3575928 |
|---|---|
| Coefficient of variation (CV) | 0.07625486 |
| Kurtosis | -0.1358849 |
| Mean | 109.60079 |
| Median Absolute Deviation (MAD) | 5.613975 |
| Skewness | 0.4054297 |
| Sum | 550195.96 |
| Variance | 69.849357 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 128.190643 | 370 | 7.4% |
| 116.825264 | 368 | 7.3% |
| 119.42379 | 368 | 7.3% |
| 110.370529 | 367 | 7.3% |
| 107.608238 | 364 | 7.3% |
| 104.756554 | 363 | 7.2% |
| 112.768845 | 362 | 7.2% |
| 98.678513 | 355 | 7.1% |
| 114.590111 | 355 | 7.1% |
| 101.447403 | 355 | 7.1% |
| Other values (4) | 1393 |
| Value | Count | Frequency (%) |
| 95.323753 | 343 | |
| 98.678513 | 355 | |
| 101.447403 | 355 | |
| 104.756554 | 363 | |
| 105.26667 | 346 | |
| 106.816666 | 354 | |
| 107.608238 | 364 | |
| 110.370529 | 367 | |
| 110.416664 | 350 | |
| 112.768845 | 362 |
| Value | Count | Frequency (%) |
| 128.190643 | 370 | |
| 119.42379 | 368 | |
| 116.825264 | 368 | |
| 114.590111 | 355 | |
| 112.768845 | 362 | |
| 110.416664 | 350 | |
| 110.370529 | 367 | |
| 107.608238 | 364 | |
| 106.816666 | 354 | |
| 105.26667 | 346 |
Product Name
Categorical
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| Thai Tea | |
|---|---|
| Cheese Stick | |
| Ginger Candy | |
| Coffee Candy | |
| Crackers | |
| Other values (5) |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 9.0681275 |
| Min length | 3 |
Characters and Unicode
| Total characters | 45522 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Crackers |
|---|---|
| 2nd row | Yoghurt |
| 3rd row | Choco Bar |
| 4th row | Choco Bar |
| 5th row | Yoghurt |
Common Values
| Value | Count | Frequency (%) |
| Thai Tea | 814 | |
| Cheese Stick | 620 | |
| Ginger Candy | 530 | |
| Coffee Candy | 522 | |
| Crackers | 519 | |
| Yoghurt | 488 | |
| Oat | 485 | |
| Choco Bar | 397 | |
| Potato Chip | 390 | |
| Cashew | 255 | 5.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| candy | 1052 | |
| thai | 814 | |
| tea | 814 | |
| cheese | 620 | 7.5% |
| stick | 620 | 7.5% |
| ginger | 530 | 6.4% |
| coffee | 522 | 6.3% |
| crackers | 519 | 6.3% |
| yoghurt | 488 | 5.9% |
| oat | 485 | 5.8% |
| Other values (5) | 1829 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5022 | 11.0% |
| a | 4726 | 10.4% |
| 3803 | 8.4% | |
| C | 3755 | 8.2% |
| h | 2964 | 6.5% |
| o | 2584 | 5.7% |
| r | 2453 | 5.4% |
| t | 2373 | 5.2% |
| i | 2354 | 5.2% |
| T | 1628 | 3.6% |
| Other values (17) | 13860 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33426 | |
| Uppercase Letter | 8293 | 18.2% |
| Space Separator | 3803 | 8.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5022 | |
| a | 4726 | |
| h | 2964 | |
| o | 2584 | |
| r | 2453 | 7.3% |
| t | 2373 | 7.1% |
| i | 2354 | 7.0% |
| n | 1582 | 4.7% |
| c | 1536 | 4.6% |
| s | 1394 | 4.2% |
| Other values (8) | 6438 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3755 | |
| T | 1628 | |
| S | 620 | 7.5% |
| G | 530 | 6.4% |
| Y | 488 | 5.9% |
| O | 485 | 5.8% |
| B | 397 | 4.8% |
| P | 390 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 3803 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41719 | |
| Common | 3803 | 8.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5022 | |
| a | 4726 | 11.3% |
| C | 3755 | 9.0% |
| h | 2964 | 7.1% |
| o | 2584 | 6.2% |
| r | 2453 | 5.9% |
| t | 2373 | 5.7% |
| i | 2354 | 5.6% |
| T | 1628 | 3.9% |
| n | 1582 | 3.8% |
| Other values (16) | 12278 |
Common
| Value | Count | Frequency (%) |
| 3803 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5022 | 11.0% |
| a | 4726 | 10.4% |
| 3803 | 8.4% | |
| C | 3755 | 8.2% |
| h | 2964 | 6.5% |
| o | 2584 | 5.7% |
| r | 2453 | 5.4% |
| t | 2373 | 5.2% |
| i | 2354 | 5.2% |
| T | 1628 | 3.6% |
| Other values (17) | 13860 |
Price_y
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9684.8008 |
| Minimum | 3200 |
|---|---|
| Maximum | 18000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 3200 |
|---|---|
| 5-th percentile | 3200 |
| Q1 | 4200 |
| median | 9400 |
| Q3 | 15000 |
| 95-th percentile | 18000 |
| Maximum | 18000 |
| Range | 14800 |
| Interquartile range (IQR) | 10800 |
Descriptive statistics
| Standard deviation | 4600.7088 |
|---|---|
| Coefficient of variation (CV) | 0.47504423 |
| Kurtosis | -1.1395182 |
| Mean | 9684.8008 |
| Median Absolute Deviation (MAD) | 5200 |
| Skewness | 0.16819672 |
| Sum | 48617700 |
| Variance | 21166521 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4200 | 814 | |
| 15000 | 620 | |
| 3200 | 530 | |
| 9400 | 522 | |
| 7500 | 519 | |
| 10000 | 488 | |
| 16000 | 485 | |
| 8800 | 397 | |
| 12000 | 390 | |
| 18000 | 255 | 5.1% |
| Value | Count | Frequency (%) |
| 3200 | 530 | |
| 4200 | 814 | |
| 7500 | 519 | |
| 8800 | 397 | |
| 9400 | 522 | |
| 10000 | 488 | |
| 12000 | 390 | |
| 15000 | 620 | |
| 16000 | 485 | |
| 18000 | 255 | 5.1% |
| Value | Count | Frequency (%) |
| 18000 | 255 | 5.1% |
| 16000 | 485 | |
| 15000 | 620 | |
| 12000 | 390 | |
| 10000 | 488 | |
| 9400 | 522 | |
| 8800 | 397 | |
| 7500 | 519 | |
| 4200 | 814 | |
| 3200 | 530 |
Age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 54 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.003586 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 9 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 30 |
| median | 39 |
| Q3 | 51 |
| 95-th percentile | 60 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 12.834719 |
|---|---|
| Coefficient of variation (CV) | 0.32083922 |
| Kurtosis | -0.65678737 |
| Mean | 40.003586 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.024523613 |
| Sum | 200818 |
| Variance | 164.73001 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 45 | 207 | 4.1% |
| 34 | 187 | 3.7% |
| 31 | 166 | 3.3% |
| 60 | 166 | 3.3% |
| 26 | 165 | 3.3% |
| 33 | 160 | 3.2% |
| 40 | 155 | 3.1% |
| 37 | 149 | 3.0% |
| 51 | 147 | 2.9% |
| 54 | 143 | 2.8% |
| Other values (44) | 3375 |
| Value | Count | Frequency (%) |
| 0 | 9 | 0.2% |
| 2 | 16 | 0.3% |
| 3 | 6 | 0.1% |
| 18 | 66 | |
| 19 | 104 | |
| 20 | 48 | |
| 21 | 73 | |
| 22 | 108 | |
| 23 | 83 | |
| 24 | 83 |
| Value | Count | Frequency (%) |
| 72 | 10 | 0.2% |
| 70 | 8 | 0.2% |
| 69 | 12 | 0.2% |
| 68 | 12 | 0.2% |
| 66 | 13 | 0.3% |
| 65 | 13 | 0.3% |
| 62 | 50 | 1.0% |
| 61 | 93 | |
| 60 | 166 | |
| 59 | 122 |
Gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5020 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2746 | |
| 1 | 2274 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2746 | |
| 1 | 2274 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2746 | |
| 1 | 2274 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5020 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2746 | |
| 1 | 2274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5020 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2746 | |
| 1 | 2274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2746 | |
| 1 | 2274 |
Marital Status
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.4 KiB |
| Married | |
|---|---|
| Single |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.7615538 |
| Min length | 6 |
Characters and Unicode
| Total characters | 33943 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married |
|---|---|
| 2nd row | Married |
| 3rd row | Single |
| 4th row | Married |
| 5th row | Married |
Common Values
| Value | Count | Frequency (%) |
| Married | 3823 | |
| Single | 1197 | 23.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 3823 | |
| single | 1197 | 23.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7646 | |
| i | 5020 | |
| e | 5020 | |
| M | 3823 | |
| a | 3823 | |
| d | 3823 | |
| S | 1197 | 3.5% |
| n | 1197 | 3.5% |
| g | 1197 | 3.5% |
| l | 1197 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28923 | |
| Uppercase Letter | 5020 | 14.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 7646 | |
| i | 5020 | |
| e | 5020 | |
| a | 3823 | |
| d | 3823 | |
| n | 1197 | 4.1% |
| g | 1197 | 4.1% |
| l | 1197 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 3823 | |
| S | 1197 | 23.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33943 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 7646 | |
| i | 5020 | |
| e | 5020 | |
| M | 3823 | |
| a | 3823 | |
| d | 3823 | |
| S | 1197 | 3.5% |
| n | 1197 | 3.5% |
| g | 1197 | 3.5% |
| l | 1197 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33943 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 7646 | |
| i | 5020 | |
| e | 5020 | |
| M | 3823 | |
| a | 3823 | |
| d | 3823 | |
| S | 1197 | 3.5% |
| n | 1197 | 3.5% |
| g | 1197 | 3.5% |
| l | 1197 | 3.5% |
Income
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 369 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6237131 |
| Minimum | 0 |
|---|---|
| Maximum | 71.3 |
| Zeros | 185 |
| Zeros (%) | 3.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4.22 |
| median | 7.72 |
| Q3 | 10.78 |
| 95-th percentile | 18.89 |
| Maximum | 71.3 |
| Range | 71.3 |
| Interquartile range (IQR) | 6.56 |
Descriptive statistics
| Standard deviation | 6.5182417 |
|---|---|
| Coefficient of variation (CV) | 0.75585094 |
| Kurtosis | 19.876027 |
| Mean | 8.6237131 |
| Median Absolute Deviation (MAD) | 3.285 |
| Skewness | 2.9748029 |
| Sum | 43291.04 |
| Variance | 42.487474 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 185 | 3.7% |
| 5.12 | 44 | 0.9% |
| 8.96 | 42 | 0.8% |
| 6.05 | 42 | 0.8% |
| 9.57 | 42 | 0.8% |
| 5.35 | 35 | 0.7% |
| 3.28 | 33 | 0.7% |
| 6.19 | 33 | 0.7% |
| 9.68 | 32 | 0.6% |
| 2.69 | 31 | 0.6% |
| Other values (359) | 4501 |
| Value | Count | Frequency (%) |
| 0 | 185 | |
| 0.06 | 10 | 0.2% |
| 0.14 | 13 | 0.3% |
| 0.18 | 12 | 0.2% |
| 0.57 | 10 | 0.2% |
| 0.74 | 9 | 0.2% |
| 0.98 | 6 | 0.1% |
| 1 | 7 | 0.1% |
| 1.12 | 12 | 0.2% |
| 1.28 | 16 | 0.3% |
| Value | Count | Frequency (%) |
| 71.3 | 8 | |
| 54.2 | 14 | |
| 35.78 | 13 | |
| 33.77 | 12 | |
| 28.23 | 13 | |
| 25.22 | 10 | |
| 23.84 | 11 | |
| 21.81 | 16 | |
| 20.81 | 17 | |
| 20.64 | 9 |
| CustomerID | Price_x | Qty | TotalAmount | StoreID | Latitude | Longitude | Price_y | Age | Income | ProductID | StoreName | GroupStore | Type | Product Name | Gender | Marital Status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CustomerID | 1.000 | -0.019 | -0.006 | -0.029 | 0.004 | 0.008 | -0.006 | -0.019 | -0.034 | -0.021 | 0.000 | 0.013 | 0.000 | 0.017 | 0.000 | 0.163 | 0.182 |
| Price_x | -0.019 | 1.000 | -0.390 | 0.467 | -0.028 | -0.024 | 0.018 | 1.000 | 0.016 | -0.002 | 1.000 | 0.027 | 0.026 | 0.015 | 1.000 | 0.041 | 0.000 |
| Qty | -0.006 | -0.390 | 1.000 | 0.586 | 0.011 | 0.002 | -0.010 | -0.390 | -0.032 | -0.025 | 0.289 | 0.027 | 0.020 | 0.027 | 0.289 | 0.018 | 0.020 |
| TotalAmount | -0.029 | 0.467 | 0.586 | 1.000 | -0.012 | -0.014 | 0.007 | 0.467 | -0.020 | -0.023 | 0.369 | 0.022 | 0.018 | 0.046 | 0.369 | 0.028 | 0.000 |
| StoreID | 0.004 | -0.028 | 0.011 | -0.012 | 1.000 | 0.594 | -0.172 | -0.028 | -0.005 | -0.003 | 0.018 | 0.912 | 0.839 | 0.747 | 0.018 | 0.017 | 0.000 |
| Latitude | 0.008 | -0.024 | 0.002 | -0.014 | 0.594 | 1.000 | -0.342 | -0.024 | 0.009 | 0.015 | 0.000 | 0.873 | 0.699 | 0.747 | 0.000 | 0.000 | 0.000 |
| Longitude | -0.006 | 0.018 | -0.010 | 0.007 | -0.172 | -0.342 | 1.000 | 0.018 | 0.013 | -0.003 | 0.009 | 0.883 | 0.784 | 0.924 | 0.009 | 0.023 | 0.000 |
| Price_y | -0.019 | 1.000 | -0.390 | 0.467 | -0.028 | -0.024 | 0.018 | 1.000 | 0.016 | -0.002 | 1.000 | 0.027 | 0.026 | 0.015 | 1.000 | 0.041 | 0.000 |
| Age | -0.034 | 0.016 | -0.032 | -0.020 | -0.005 | 0.009 | 0.013 | 0.016 | 1.000 | 0.613 | 0.003 | 0.000 | 0.000 | 0.000 | 0.003 | 0.188 | 0.638 |
| Income | -0.021 | -0.002 | -0.025 | -0.023 | -0.003 | 0.015 | -0.003 | -0.002 | 0.613 | 1.000 | 0.012 | 0.020 | 0.000 | 0.000 | 0.012 | 0.142 | 0.410 |
| ProductID | 0.000 | 1.000 | 0.289 | 0.369 | 0.018 | 0.000 | 0.009 | 1.000 | 0.003 | 0.012 | 1.000 | 0.021 | 0.023 | 0.000 | 1.000 | 0.067 | 0.000 |
| StoreName | 0.013 | 0.027 | 0.027 | 0.022 | 0.912 | 0.873 | 0.883 | 0.027 | 0.000 | 0.020 | 0.021 | 1.000 | 0.957 | 0.999 | 0.021 | 0.000 | 0.000 |
| GroupStore | 0.000 | 0.026 | 0.020 | 0.018 | 0.839 | 0.699 | 0.784 | 0.026 | 0.000 | 0.000 | 0.023 | 0.957 | 1.000 | 1.000 | 0.023 | 0.000 | 0.000 |
| Type | 0.017 | 0.015 | 0.027 | 0.046 | 0.747 | 0.747 | 0.924 | 0.015 | 0.000 | 0.000 | 0.000 | 0.999 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 |
| Product Name | 0.000 | 1.000 | 0.289 | 0.369 | 0.018 | 0.000 | 0.009 | 1.000 | 0.003 | 0.012 | 1.000 | 0.021 | 0.023 | 0.000 | 1.000 | 0.067 | 0.000 |
| Gender | 0.163 | 0.041 | 0.018 | 0.028 | 0.017 | 0.000 | 0.023 | 0.041 | 0.188 | 0.142 | 0.067 | 0.000 | 0.000 | 0.000 | 0.067 | 1.000 | 0.018 |
| Marital Status | 0.182 | 0.000 | 0.020 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.638 | 0.410 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 1.000 |
| TransactionID | CustomerID | Date | ProductID | Price_x | Qty | TotalAmount | StoreID | StoreName | GroupStore | Type | Latitude | Longitude | Product Name | Price_y | Age | Gender | Marital Status | Income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | TR11369 | 328 | 01/01/2022 | P3 | 7500 | 4 | 30000 | 12 | Prestasi Utama | Prestasi | General Trade | -2.990934 | 104.756554 | Crackers | 7500 | 36 | 0 | Married | 10.53 |
| 1 | TR16356 | 165 | 01/01/2022 | P9 | 10000 | 7 | 70000 | 1 | Prima Tendean | Prima | Modern Trade | -6.200000 | 106.816666 | Yoghurt | 10000 | 44 | 1 | Married | 14.58 |
| 2 | TR1984 | 183 | 01/01/2022 | P1 | 8800 | 4 | 35200 | 4 | Gita Ginara | Gita | General Trade | -6.966667 | 110.416664 | Choco Bar | 8800 | 27 | 1 | Single | 0.18 |
| 3 | TR35256 | 160 | 01/01/2022 | P1 | 8800 | 7 | 61600 | 4 | Gita Ginara | Gita | General Trade | -6.966667 | 110.416664 | Choco Bar | 8800 | 48 | 1 | Married | 12.57 |
| 4 | TR41231 | 386 | 01/01/2022 | P9 | 10000 | 1 | 10000 | 4 | Gita Ginara | Gita | General Trade | -6.966667 | 110.416664 | Yoghurt | 10000 | 33 | 0 | Married | 6.95 |
| 5 | TR51675 | 283 | 01/01/2022 | P10 | 15000 | 1 | 15000 | 5 | Bonafid | Gita | General Trade | -7.250445 | 112.768845 | Cheese Stick | 15000 | 19 | 1 | Single | 0.00 |
| 6 | TR54287 | 51 | 01/01/2022 | P8 | 16000 | 2 | 32000 | 2 | Prima Kelapa Dua | Prima | Modern Trade | -6.914864 | 107.608238 | Oat | 16000 | 36 | 0 | Married | 7.95 |
| 7 | TR67455 | 49 | 01/01/2022 | P5 | 4200 | 3 | 12600 | 13 | Buana | Buana | General Trade | -1.269160 | 116.825264 | Thai Tea | 4200 | 44 | 1 | Married | 13.48 |
| 8 | TR73041 | 222 | 01/01/2022 | P9 | 10000 | 6 | 60000 | 4 | Gita Ginara | Gita | General Trade | -6.966667 | 110.416664 | Yoghurt | 10000 | 45 | 0 | Married | 15.03 |
| 9 | TR7596 | 270 | 01/01/2022 | P7 | 9400 | 2 | 18800 | 14 | Priangan | Priangan | Modern Trade | -5.450000 | 105.266670 | Coffee Candy | 9400 | 49 | 1 | Married | 8.81 |
| TransactionID | CustomerID | Date | ProductID | Price_x | Qty | TotalAmount | StoreID | StoreName | GroupStore | Type | Latitude | Longitude | Product Name | Price_y | Age | Gender | Marital Status | Income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5010 | TR1380 | 266 | 31/12/2022 | P9 | 10000 | 3 | 30000 | 11 | Sinar Harapan | Prestasi | General Trade | 0.533505 | 101.447403 | Yoghurt | 10000 | 72 | 1 | Married | 4.72 |
| 5011 | TR31574 | 212 | 31/12/2022 | P7 | 9400 | 2 | 18800 | 13 | Buana | Buana | General Trade | -1.269160 | 116.825264 | Coffee Candy | 9400 | 36 | 0 | Married | 7.96 |
| 5012 | TR37544 | 395 | 31/12/2022 | P3 | 7500 | 2 | 15000 | 9 | Lingga | Lingga | Modern Trade | -3.654703 | 128.190643 | Crackers | 7500 | 28 | 0 | Married | 3.39 |
| 5013 | TR38129 | 253 | 31/12/2022 | P3 | 7500 | 5 | 37500 | 4 | Gita Ginara | Gita | General Trade | -6.966667 | 110.416664 | Crackers | 7500 | 37 | 0 | Married | 4.32 |
| 5014 | TR45899 | 232 | 31/12/2022 | P6 | 18000 | 1 | 18000 | 9 | Lingga | Lingga | Modern Trade | -3.654703 | 128.190643 | Cashew | 18000 | 62 | 0 | Married | 7.32 |
| 5015 | TR54423 | 243 | 31/12/2022 | P10 | 15000 | 5 | 75000 | 3 | Prima Kota | Prima | Modern Trade | -7.797068 | 110.370529 | Cheese Stick | 15000 | 38 | 0 | Married | 3.34 |
| 5016 | TR5604 | 271 | 31/12/2022 | P2 | 3200 | 4 | 12800 | 9 | Lingga | Lingga | Modern Trade | -3.654703 | 128.190643 | Ginger Candy | 3200 | 29 | 0 | Married | 4.74 |
| 5017 | TR81224 | 52 | 31/12/2022 | P7 | 9400 | 6 | 56400 | 9 | Lingga | Lingga | Modern Trade | -3.654703 | 128.190643 | Coffee Candy | 9400 | 37 | 0 | Married | 3.73 |
| 5018 | TR85016 | 18 | 31/12/2022 | P8 | 16000 | 3 | 48000 | 13 | Buana | Buana | General Trade | -1.269160 | 116.825264 | Oat | 16000 | 47 | 0 | Married | 13.60 |
| 5019 | TR85684 | 55 | 31/12/2022 | P8 | 16000 | 1 | 16000 | 6 | Lingga | Lingga | Modern Trade | -5.135399 | 119.423790 | Oat | 16000 | 34 | 1 | Married | 8.44 |